Compression and Visualization of High-Dimensionality Data Using Auto-Associative Neural Networks

نویسندگان

  • Zalhan Mohd Zin
  • Marzuki Khalid
  • Ehsan Mesbahi
  • Rubiyah Yusof
چکیده

Interpreting the information hidden in multidimensional data can be considered as a challenging and also a complicated task. The compression, dimension reduction and visualization of these multidimensional data provide ways to better understanding and interpretation of the problem. Usually, dimension reduction or compression is considered as the first step to data analysis and exploration. Here, the focus is given on multidimensional data reduction using a supervised artificial neural networks technique namely the Auto-Associative Neural Networks (AANN). The AANN can be considered as very powerful tool in exploratory data analysis. It has the ability to deal with linear and nonlinear correlation among variables. This technique is often referred to as nonlinear principal component analysis (NLPCA) or sometimes is also known as the bottleneck neural network due to its specific structure that consists of a combination of two networks – compression and decompression. By using this structure, AANN can reduce high dimensional data onto lower dimensional data by compressing them on its bottleneck layer that later can be used for data visualization and interpretation. In this paper, the technique of AANN is described, developed and applied on two different multidimensional datasets. The results have shown that the AANN is able to compress multidimensional data into only two nonlinear principal components at its bottleneck layer and these compressed data can provide visualization of different clusters of data. Keywords—Auto-Associative Neural Networks, Dimension Reduction, Data Clustering, Iris flowers, Olive oils

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Self-Reconstructing Algorithm for Single and Multiple-Sensor Fault Isolation Based on Auto-Associative Neural Networks

Recently different approaches have been developed in the field of sensor fault diagnostics based on Auto-Associative Neural Network (AANN). In this paper we present a novel algorithm called Self reconstructing Auto-Associative Neural Network (S-AANN) which is able to detect and isolate single faulty sensor via reconstruction. We have also extended the algorithm to be applicable in multiple faul...

متن کامل

Rapid authentication of animal cell lines using pyrolysis mass spectrometry and auto-associative artificial neural networks.

Pyrolysis mass spectrometry (PyMS) was used to produce biochemical fingerprints from replicate frozen cell cultures of mouse macrophage hybridoma 2C11-12, human leukaemia K562, baby hamster kidney BHK 21/C13, and mouse tumour BW-O, and a fresh culture of Chinese hamster ovary CHO cells. The dimensionality of these data was reduced by the unsupervised feature extraction pattern recognition techn...

متن کامل

Dimensionality reduction techniques for multivariate data classification, interactive visualization, and analysis-systematic feature selection vs. extraction

The curse of dimensionality, i.e., the fact that feature spaces of increasing dimensionality with finite sample sizes tend to be empty, has given incentive to a plethora of research activities in various disciplines and diverse application fields, e.g., statistics or neural networks. Three major application fields are multivariate data classification, data analysis, and data visualization. In t...

متن کامل

Estimating Missing Data Using Neural Network Techniques, Principal Component Analysis and Genetic Algorithms

The common problem of missing data in databases is being dealt with, in recent years, through estimation methods. Auto-associative neural networks combined with genetic algorithms have proved to be a successful approach to missing data imputation. Similarly, two new auto-associative models are developed to be used along with the Genetic Algorithm to estimate missing data and these approaches ar...

متن کامل

The Diagnosis of Brucellosis in Rafsanjan City Using Deep Auto-Encoder Neural Networks

Introduction: Brucellosis is considered as one of the most important common infectious diseases between humans and animals. Considering the endemic nature of brucellosis and the existence of numerous reports of human and animal cases of brucellosis in Iran, the incidence of human brucellosis in Rafsanjan city was determined in the last 3 years (2016–2018). The main objective of this study was t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012